A Reordering Approach for Statistical Machine Translation

نویسنده

  • Tin Myat Htwe
چکیده

This paper presents a Markov based hierarchical reordering scheme for lexical reordering to incorporate into phrase-based statistical machine translation system. The goal is to reorder the words and phrases in source language syntactic structure into their corresponding target language syntactic order for making translation easy. Without reordering during language translation, sentences can only be translated properly into a language with similar word order. An effective reordering scheme is essential to model translation between languages with different word orders, such as SVO-languages (English or Chinese) and SOVlanguages (Japanese or Turkish or Myanmar). In earlier work we have shown a phrase based statistical translation model for English Myanmar language translation [21]. In this paper, we focus on reorder scheme for lexical reordering to incorporate into our translation model. For a given source text sentence, reordering performs at three different lexical hierarchical levels; words level where words in each chunk are reordered in target language word order, chunks level where chunks in specific sentence clauses are reordered and clauses level where clauses in the sentence are reordered. All reordering processes will be performed by using reorder Markov model to predicate reordering of neighbor blocks (phrase pairs).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Machine Translation System Based on a Monotone Decoder

In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...

متن کامل

Novel Reordering Approaches in Phrase-Based Statistical Machine Translation

This paper presents novel approaches to reordering in phrase-based statistical machine translation. We perform consistent reordering of source sentences in training and estimate a statistical translation model. Using this model, we follow a phrase-based monotonic machine translation approach, for which we develop an efficient and flexible reordering framework that allows to easily introduce dif...

متن کامل

Reordering Models for Statistical Machine Translation: A Literature Survey

In this survey, we briefly study various reordering models that are used with statistical translation models. Reordering model is one of the important component of any statistical machine translation system. Problem of reordering is NP-Hard itself. In this survey, we study various reordering approaches that can be used to solve this problem. We first study simple distortion-based reordering whi...

متن کامل

Computing multiple weighted reordering hypotheses for a statistical machine translation phrase-based systems

Reordering is one source of error in statistical machine translation (SMT). This paper extends the study of the statistical machine reordering (SMR) approach, which uses the powerful techniques of the SMT systems to solve reordering problems. Here, the novelties yield in: (1) using the SMR approach in a SMT phrase-based system, (2) adding a feature function in the SMR step, and (3) analyzing th...

متن کامل

Improved chunk-level reordering for statistical machine translation

Inspired by previous chunk-level reordering approaches to statistical machine translation, this paper presents two methods to improve the reordering at the chunk level. By introducing a new lattice weighting factor and by reordering the training source data, an improvement is reported on TER and BLEU. Compared to the previous chunklevel reordering approach, the BLEU score improves 1.4% absolute...

متن کامل

Phrase reordering for statistical machine translation based on predicate-argument structure

In this paper, we describe a novel phrase reordering model based on predicate-argument structure. Our phrase reordering method utilizes a general predicate-argument structure analyzer to reorder source language chunks based on predicate-argument structure. We explicitly model longdistance phrase alignments by reordering arguments and predicates. The reordering approach is applied as a preproces...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007